Feasibility of Human-in-the-loop Minimum Error Rate Training

نویسندگان

  • Omar Zaidan
  • Chris Callison-Burch
چکیده

Minimum error rate training (MERT) involves choosing parameter values for a machine translation (MT) system that maximize performance on a tuning set as measured by an automatic evaluation metric, such as BLEU. The method is best when the system will eventually be evaluated using the same metric, but in reality, most MT evaluations have a human-based component. Although performing MERT with a human-based metric seems like a daunting task, we describe a new metric, RYPT, which takes human judgments into account, but only requires human input to build a database that can be reused over and over again, hence eliminating the need for human input at tuning time. In this investigative study, we analyze the diversity (or lack thereof) of the candidates produced during MERT, we describe how this redundancy can be used to our advantage, and show that RYPT is a better predictor of translation quality than BLEU.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Error assessment in man-machine systems using the CREAM method and human-in-the-loop fault tree analysis

Background and Objectives: Despite contribution to catastrophic accidents, human errors have been generally ignored in the design of human-machine (HM) systems and the determination of the level of automation (LOA). This paper aims to develop a method to estimate the level of automation in the early stage of the design phase considering both human and machine performance. Methods: A quantita...

متن کامل

Potential Effects of Climatic Parameters on Human Brucellosis in Fars Province, Iran, during 2009-2015

Background: Human brucellosis is widespread in Fars province. The present study aimed to investigate the effect of climate on its incidence and determine the areas prone to the infection.Methods: Monthly meteorological data and the incidence rate of human brucellosis during 2009-2015 were collected and their correlation was studied using Pearson’s correlation coefficient. Additionally, th...

متن کامل

Covariance Analysis of a vector tracking GPS receiver based on MMSE multiuser Detection

In high dynamic conditions, using vector tracking loops instead of scalar tracking loops in GPS receivers is proved as an efficient method to compensate the performance. The Minimum Mean Squared Error detector as a multiuser detector is applied in the vector tracking loop for more reliability and efficiency. The Kalman filter does the two tasks of tracking and extracting the navigation data aft...

متن کامل

Evaluation of Human Reliability by Standardized Plant Analysis Risk HRA (SPAR-H) method in the Dialysis Process in Ebne Sina Hospital, Shiraz

Background and Objectives: Human errors in dialysis care can cause injury and death. One of the basic steps to increase reliability in this critical process is to analyze the error and identify the weaknesses of doing this process. Methods: The present study is a descriptive-analytic cross-sectional study. The SPAR-H method was used to identify and evaluate the probability of human error in th...

متن کامل

Investigation of human error by using THERP method in control room of incoiler department in a pipe manufacturing company

Background & Aims of the Study: Today, in many sensitive occupational environments, human error can lead to catastrophic events. Given that the sensitive task of a control area operator, which in the occurrence of malfunction or failure leads to irreparable events, it is important to predict human errors to reduce its adverse consequences. Therefore, the present study was  perform by aiming to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009